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[57] 



ABSTRACT 



A disc array system has a plurality of disc storage devices 
(i.e., drives) constituting a parity group and mounted in a 
distributed manner on different boards. Where any one drive 
in the^ame parity group develops a fault , the bo ard carrying 
the faulty^drive is wiuSarawn from the mother boardTdr 

j-epairjf tfieCPU issues a write request for data held in the 
faulty drive or in any normal data on the same board, the 
requested data is recovered from the multiple ^u mained 

rinrthe - otherdrivescoj&Ututing thej^e^lo^cal group_and 
mounted-on the other-boardsTThe recovered data is sent to 
the£PJLL— — " — — > 

If the CPU supplies write data to the faulty drive or to any 
normal drive on the same board, the_suppiied . wntejlata is 
Jteldtanporarily in a cache^mejnory in an array contxojlex. 
LaterTTwh^-the-bo^d^ 

mortis toard,_ the ~dataao_be^ held_in jhe alternate drive 
having replaced the faulty drive is recovered from the data 
in the'omeronveV, aM 

altemateioMve^ttej^ memory is 

writtejr^me^aUerna norn^ldrive^ 

42 Claims, 10 Drawing Sheets 
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FIG. 3 
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FIG. 6 




D#1-1 
SD#1 SD#2 

o o 

SD#3 SD#4 
BADR1 





-2 




o 


SD#1 


SD#2 


O 


O 


S0#3 


SD#4 



MM -3 



6 o 

SD#1 SD#2 

o o 

SD#3 SD#4 




O 

SD#1 SD#2 

o o 

S0#3 SD#4 



BADR2 



BADR3 



BADR4 



04/20/2004, EAST Version: 1.4.1 



U.S. Patent Feb. 4, 1997 Sheet 8 of 10 5,600,783 



FIG. 7 



21 LOGICAL GROUP TABLE 



22 

I 5- 


23 






24 


25 


26 

/ 


LOGICAL 
ADDRESS 


logical' 

GROUP No. 


SUB-DATA NAME 


PARITY ' 
DATA NANE 


-1- 


D#1 




D#1-1 


D#1-2 


D#1-3 


P#1 


Spare I 


D#2 




D#2-1 


D#2-2 


D#2-3 


P#2 


Spare2 


D#3 




D#3-1 


D#3-2 


0#3-3 


P#3 


Spare3 


D#4 


LG#1 


D#4-l 


D#4-2 


D#4-3 


P#4 


Spare4 


D#5 




D#5-1 


D#S-2 


D#5-3 


P#5 


Spare5 












































0#1 1 




D#11-1 


0#1 1 -2 


D#11-3 


P#11 


Sparel 1 


D#12 




D#12-1 


D#12-2 


D#12-3 


P#12 


Spare12 


D#13 




D#13-1 


D#13-2 


D#13-3 


P#13 


Sparel 3 


D#14 


LG#2 


D#14-1 


D#14-2 


D#14-3 


P#14 


Sparel 4 


D#15 




0#15-1 


0#15-2 


D#15-3 


P#15 


Sparel 5 































04/20/2004, EAST Version: 1.4.1 



U.S. Patent 



Feb. 4, 1997 



Sheet 9 of 10 



5,600,783 




04/20/2004, EAST Version: 1.4.1 



U.S. Patent Feb. 4, 1997 Sheet 10 of 10 5,600,783 



FIG. 9 

31 ADORESS TABLE 



P 27 y 29 ;30 27 100 ^ f 2 9 Z 32 









^ \^ \^ SCSI DRIVE ADDRESS J / \ 




LOGICAL 


LOGICAL 




DATA ^ Ny 


/ [ PARITY ] \ 


INTER- 


GROUP NO 


ADDRESS 


BOARD 
ADDRESS 


BOARD 
WITH- 
DRAWAL 


DRIVE 
No. 




BOARD 
ADDRESS 


BOARD 

WTTM- 
Ri I n 

DRAWAL 


DRIVE 
No. 


PAIN T 


LFnlVC 

ADDRESS 




on 


BADR1 


0 


















m 


BADR2 


0 


SD#I 


0 


BADR4 


0 


S0#1 


0 


SADR1 




Die 


BADR3 


0 


















0#4 


BADR2 


0 


















DI5 


BADR3 


0 


SDfl 


0 


BADR1 


0 


SD#1 


0 


SADR2 




U86 


BADR4 


0 


















0#7 


BADR3 


0 


















m 


BADR4 


0 


SDtl 


0 


BADR2 


0 


S0#1 


0 


SADR3 




m 


BADR1 


0 


















0#I0 


BADR4 


0 
















i_urr i 




BADR1 


0 


SDfl 


0 


BADR3 


0 


SD#1 


0 


SA0R4 




0#12 


BADR2 


0 


















D#13 


BADR1 


0 


















Wf14 


BADR2 


0 


SD#2 


0 


BADR4 


0 


S0#2 


0 


SADRI 




D#15 


BADR3 


0 


















D#16 


BADR2 


0 


















D#17 


BADR3 


0 


SD#2 


0 


BADR1 


0 


S0#2 


0 


SADR2 




0S18 


BADR4 


0 


















0#16 


BA0R3 


0 


















0#17 


BADR4 


0 


SKf2 


0 


BADR2 


0 


SD#2 


0 


SADR3 




0#18 


8ADR1 


0 


















D#16 


BADR4 


0 


















0#17 


BADR1 


0 


SD#2 


0 


8ADR3 


0 


SD#2 


0 


SADR4 




0516 


BADR2 


0 




















i 



















04/20/2004, EAST Version: 1.4.1 



5,600,783 



DISC ARRAY SYSTEM HAVING DISC 
STORAGE DEVICES DISPERSED ON 
PLURAL BOARDS AND ACCESSIBLE AT 
WITHDRAWAL OF PART OF THE BOARDS 



BACKGROUND OF THE INVENTION 

ITie present invention relates to a disc array system 
comprising a plurality of disc storage devices that retain a 
plurality of data together with error correction codes. 

In existing computer systems, the data needed by the CPU 
or by the similar high-order facility is held in a secondary 
storage device. When necessary, the CPU or its equivalent 
accesses the secondary storage device to write or read data 
thereto or therefrom. The secondary storage device gener- 
ally utilizes a nonvolatile storage medium and typically 
constitutes a magnetic disc unit or an optical disc unit. Such 
disc units will each be referred as a disc drive hereunder. 

With today's society getting increasingly information- 
oriented, computer systems require secondary storage 
devices of correspondingly enhanced performance. One 
proposed solution to this challenge is a disc array setup 
composed of numerous disc drives of a relatively small 
capacity each. 

The solution is discussed illustratively by D. Patterson, G 
Gibson and R. H. Kartz, "A Case for Redundant Arrays of 
Inexpensive Discs (RAID)" ACM SIGMD Conference, 
Chicago, HI, (June 1988), pp. 109-116 (hereinafter referred 
to as the first prior art). The paper reports on the performance 
and reliability of disc arrays (at level 3) for dividing data and 
parallelly processing the data divisions and of disc arrays (at 
levels 4 and 5) for distributing and independently handling 
data. 

The disc arrays at level 3 divide one ^!e~dataltem 
tMsferred ixom the CPU in connection 'with a write request, 
and generate parity data for error correction based on the 
* divided data. Thc o^ (uvisions^and the parity^aata_are 
4 stored parallelly m'afplurah^.of disc drives. When thrGFJU 
orders a read operation of the di vided 7data v ^th1s^ta^ivisioxis. 
lare.fead parallelly" from the appropriate disc drives and are 
combined for transfer to the TGPU. 

The disc arrays at levels 4 and 5 differ from those at level 
3 in the following two aspects: that the divided data at level 
3 are replaced by a plurality of data items that are used in 
connection with a plurality of write requests transferred 
from the CPU; and that the data are read in units of data 
items and, unlike at level 3, are not combined for transfer to 
the CPU. 

The multiple data items and the error correction data 
generated from such data items are called an error correction 
data group, a parity group or a parity data group. Because the 
error correction data may be constituted not only by parity 
data but also by various other kinds of error correction data, 
the wording "parity group" is used in this specification even 
where the error correction data may not be parity data. 
Although the error correction codes in this specification are 
composed of parity data, this invention may also be applied 
to cases where the error correction codes are something 
other than the parity data. 

In the description that follows, a plurality of disc drives 
accommodating the data of the same parity group are 
collectively called a logical group. The logical group con- 
stitutes a fault recovery unit. If a discjlnvepf a logical group 
CbecomeTfaulty, the faiuTcan be^addressed using the data in 
■ me.omer^disc^veTwithin the^same logical group. Dlus- 
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tratively, if a disc drive holding divided data has developed 
a fault and cannot be accessed for reading~data therefrom; 
r the data in the faulty disc drive c^ be'recohlu^te^by use 
of the data and parityjlata held in the remainihg"disc drives 
of the samelogical group; In systems comprising numerous 
disc drives such as disc array systems, growing numbers of 
component parts tend to increase the probability of fault. It 
is for the purpose of enhancing the reliability of disc drives 
that the parity scheme is employed for recovery from fault. 

Also proposed is a scheme whereby aHUata~destined,to.a 
faulty drive is. written to a spare drive*until~the~faulry drive 
Hs-replaced'by jn^ aheraaterdrive; One such scheme is 
described in U.S. Pat. No. 5,077,736 (called the second prior 
art hereunder). 

With conventional disc array systems of level 3 (called the 
third prior art hereunder), a plurality of disc drives are 
connected by cables to an array controller. In this setup, the 
drives are connected to the array controller using as many 
buses as the number of drives constituting each different 
logical group. That is, the first drive of each of the different 
logical groups is connected to the array controller via a 
common first bus, by use of cables such that the first drives 
make up a daisy chain when connected. The second drive of 
each different logical group is connected likewise to the 
array controller via a common second bus, using cables such 
that the second drives also constitute a daisy chain. The other 
drives are connected in like manner. Where numerous drives 
make up the disc array system, the cable-based drive con- 
nection is not very desirable because of floor space con- 
straints and of reduced maintainability. What is desired then 
is a setup having many small drives installed therein. 

Japanese Patent Laid-open No. Hei 3-108178 (corre- 
sponding to U.S. patent application Ser. No. 409495, filed on 
Sep. 19, 1989) discloses a technique whereby a plurality of 
drives each having numerous pins at the bottom are mounted 
on a semiconductor board (the technique may be called the 
fourth prior art hereunder). The disclosed technique is aimed 
at mounting numerous small drives on the board. However, 
it is not disclosed whether or how these drives constitute 
logical groups. 

Japanese Patent Laid-open No. Hei 4-228153 (corre- 
sponding to U.S. patent application Ser. No. 502215, filed on 
Mar. 30, 1990) discloses a technique whereby a plurality of 
disc drives are mounted detachably on each of a plurality of 
boards having common wiring, the drives on one board 
being called a disc array (the technique may be called the 
fifth prior art hereunder). The multiple disc drives of each 
array belong to the same logical group. If any one drive on 
a given board develops a fault, the board carrying the faulty 
drive remains inserted in the system and the faulty drive is 
withdrawn from the board to be replaced by an alternate 
drive. In this case, the faulty drive remains accessible. 
Specifically, if a read request is generated to read data from 
the faulty drive, the data and parity data held in the other disc 
drives of the same logical group are used to provide the 
target data to be read. 

SUMMARY OF THE INVENTION 

With the fifth prior art outlined above, the arrays each 
comprising a plurality of disc drives are mounted on a 
common mother board so that numerous small drives are 
installed at high density. However, if any one drive develops 
a fault, the faulty drive must be withdrawn from the appli- 
cable board for replacement by an alternate drive. To with- 
draw any faulty drive requires providing a space wide 
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enough for a user to gain manual access to that drive. That 
space requirement poses limits to arranging numerous drive- 
carrying boards at short intervals. 

It is therefore an object of the present invention to provide 
a disc array system having a plurality of disc drives mounted 
on a plurality of boards and getting those boards arranged on 
a commons-mother board at relatively short intervals 
_ wherein/if a fault occursin anyHisc drive/the data held in 
the faulty drive may still be accessed by an upper device. 

In carrying out the invention and according to one aspect 
thereof, there is provided a disc array system comprising: a 
mother board; a plurality of boards mounted detachably on 
one side of the mother board and arranged substantially in 
parallel; a plurality of disc drives mounted on one side of 
each of the boards; and an array controller for controlling the 
writing and reading of data to and from disc drives on the 
boards via signal lines installed on the mother board; the 
array controller including: a write circuit for writing a 
plurality of data items and error correction codes used in 
connection with these data items to a plurality of disc drives 
each mounted on a different board; a read circuit used when 
a board carrying a faulty disc drive is withdrawn from the 
mother board and when an upper device accesses any disc 
drive on the withdrawn board for reading a target data item 
therefrom, the accessed drive being either the faulty disc 
drive or a normal drive, the read circuit reading a plurality 
of data items belonging to the same error correction data 
group as that of the target data item together with the parity 
data in connection with the read data items, from a plurality 
of disc drives mounted on a plurality of boards other than the 
withdrawn board; and a recovery circuit for recovering the 
target data item from the read data items and from the parity 
data read in connection therewith. 

In a preferred s^cture accardmg-to-the^invention, the 
"disc array system further comprises a random access 
memory for temporarily accommodating write data supplied^ 
by the upper device for a write operation to any one of the 
disc drives-mounted on the withdrawn board carrying the 
faulty disc drive; and a write circuit used when the board 
having an alternate drive mounted thereon for replacing the 
faulty disc drive and readied for data write operations is 
again mounted on the mother board,%ie write rircuit writing* 
tothealternate drive the data destined Yo the faulty disc drive 
but held in the random access jnemory, me write circuit also^ 1 
writing to any normal drive on the replaced board the data 
destined thereto but held in the random access memory, 
while the board is being withdrawn from the mother board 
for replacement of the faulty disc drive. 

In another preferred structure according to the invention, 
the disc array system further comprises: at least as many, 
spare drives as the number of data-holding disc drives 
mounted on a single board separate from the boards carrying 50 
the disc drives for holding data;>a write circuit used, when 
the board carrying the faulty. (Msfc. drive is wimdrawn from 
^the mother boardV for selecting one spare drive with respect" 

, to each of the^disc drives mounted on the vrithdrawn.board,~ 

"the write circuit being further r usedrwHen the upper device 55 
supplies data to be written to any target disc drive mounted 
on the withdrawn board, for writing the supplied data to the 
spare drive selected with respect to the target disc drive; and 
a write circuit used when the board having an alternate drive 
mounted thereon for replacing the faulty disc drive and 
readied far data write operations is again mounted on the 
mother board, the write circuit writing the data destined to 
the faulty disc drive to the spare drive selected with respect 
to the alternate drive. 

Other objects, features and advantages of the present 65 
invention will become apparent in the following specifica- 
tion and accompanying drawings. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 is a schematic view of a disc array system practiced 
as a first embodiment of the invention; 

FIG. 2A is a view showing the arrangement of boards in 
the first embodiment of FIG. 1; 

FIG. 2B is a view depicting connectors mounted on the 
mother board of the first embodiment; 

FIG. 3 is a detailed circuit diagram of circuits in the first 
embodiment; 

FIG. 4 is a view of a typical logical group table for use 
with the circuits of FIG. 3; 

FIG. 5 is a view of a typical address table for use with the 
circuits of FIG. 3; 

FIG. 6 is a view illustrating how the first embodiment 
writes data; 

FIG. 7 is a view of a typical logical group table used by 
a disc array system practiced as a second embodiment of the 
invention; 

FIG. 8 is a view of a typical address table for use with the 
second embodiment; and 

FIG. 9 is a view of a typical address table used by a disc 
array system practiced as a third embodiment of the inven- 
tion. 

DESCRIPTION OF THE PREFERRED 
EMBODIMENTS 

(First embodiment) 
(1) Outline of the system 

FIG. 1 is a schematic view of a disc array system practiced 
as the first embodiment of the invention. In FIG. 1, the disc 
array system comprises a plurality of disc drives 5 and an 
array controller 2 for controlling the operation of these disc 
drives. 

Reference numeral 1 represents" a CPU that orders the 
writing and reading of "data externally; The CPU 1 and the 
array controller 2 are connected by a plurality of channel 
paths 7. The array controller 2 includes: a plurality of (e.g., 
4) channel path director boards 8A carrying distributed 
channel path directors 8 for selecting channel paths 7; a 
plurality of (e.g. # 4) channel path interface boards 3A 
carrying channel interface circuits 3 for controlling data 
transfer to and from the CPU 1; a plurality of (e.g., 4) cache 
memory boards 4A carrying distributed cache memories 4 
which are nonvolatile semiconductor memories backed up 
by batteries; and a bus 37 for connecting the cache memory 
boards 4A with the channel interface boards 3A. The bus 37 
is also connected to a plurality of (e.g., 4) drive boards 5A 
each retaining a plurality of disc drives 5 and a drive 
interface circuit 20. 

FIG. 2A illustrates the arrangement of boards in the first 
embodiment of FIG. 1. As shown, the boards described 
above are mounted on a common mother board 37. In FIG. 
2A, rectangles 370 indicated on each board represent an LSI 
chip each. It should be noted that only a part of the 
rectangles 370 on the drive board 5A are LSI chips; the 
remaining majority 5 are the disc drives mounted on that 
board. The mother board 37 comprises board connector 
groups 30A, 40A, 50A and 80A indicated in FIG. 2B. It is 
by means of these connectors that the boards are inserted 
detachably in the mother board. The boards are arranged 
substantially in parallel with one another. 

The surface of the mother board supports the bus 37 for 
connecting the channel interface connector group 30A, the 
cache memory connector group 40A, and the drive board 
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connector group 50A. Also provided are signal lines 12 
(FIG. 3) for connecting the channel director connector group 
80A with the channel interface connector group 30A. 

One characteristic of the first embodiment lies in the fact 
that, when a fault has occurred in a disc drive on a drive 5 
board, that drive board together with its normal drives is 
withdrawn from the mother board 37 (i.e., the faulty drive 
not detached from the drive board) for replacing the faulty 
drive with an alternate drive, the drive board along with the 
alternate drive being replaced onto the mother board 37. The 
point is that if any disc drive has developed a fault, the drive 
board carrying the faulty drive need only be withdrawn from 
the mother board and inserted therein again after replace- 
ment of the faulty drive. This makes it possible to reduce the 
spacing between drive boards when they are mounted. 
Another characteristic of the first embodiment is that a 15 
plurality of disc drives mounted in a distributed manner on 
a plurality of drive boards constitute a logical group. Thus 
when any drive board is withdrawn from the mother board, 
the data held in any of the disc drives on the detached board 
may be recovered from the data and parity data held in the 20 
other drives mounted on the other drive boards within the 
same logical group. 

The logical group 9 is made up of m drives ,5 (illustra- 
tively four drives in the first embodiment) mounted on a 
plurality of (e.g., 4) boards. The logical group 9 is a faults 25 
recovery unit comprising the disc drives 5 that retain m-1 
error correction data groups (i.e., parity groups) composed \ 
of data and the parity data derived therefrom. 
- Although the first embodiment comprises four channel 
interface boards 3 and four drive interface boards 6, the 
numbers of these components are not limitative of the 
invention. Likewise, the number of disc drives 5 mounted on 
each board and the number of disc drives 5 constituting one 
logical group 9 are not limitative of the invention; these 
counts may be selected as desired. 

FIG. 3 is a detailed circuit diagram of each board in the 35 
first embodiment. The channel path directors 8 include 
interface adapters (IF Adp) 10 connected to the respective 
channel paths 7 and a channel path switch 11. Although the 
channel path directors 8 are actually mounted in a distrib- 
uted manner on four boards, FIG. 3 provides a compreben- 40 
sive diagram of the entire circuitry constituting the directors 
8. 

The channel interface circuits 3 each comprise: a channel 
interface (CH IF) 13 for effecting protocol conversion and 
transfer rate adjustment regarding the data transferred from 45 
the CPU 1; a data control circuit (DCC) 14 for controlling 
data transfer; a channel-side cache adapter (C Adp) 15 for 
controlling the writing of data sent from the CPU 1 into the 
cache memory 4; a parity generator 18 for generating parity 
data from the divided data written to the cache memory 4; 50 
and a microprocessor (MP) 17 for controlling the configured 
circuits. The channel interfaces 13 are connected to the 
channel path switch 11 via data lines 12. The cache adapters 
15 control the bus 37 as well as the transfer of data between 
the CPU 1 and the cache memories 4. 55 

The drive board 5A includes a drive-side cache adapter (C 
Adp) 19 and a drive interface circuit (Drive IF) 20. Refer- 
ence numeral 16 represents control signal lines that connect 
the channel path switch 11, the channel interfaces 13, the 
data control circuits 14, the channel-side cache adapters 15, 60 
the microprocessors 17, the cache memories 4, the drive- 
side cache adapters 19 and the drive interface circuits 20. 
The bus 37 comprises a control bus 37A for transferring 
control information and a data bus 37B for transferring data. 

In FIGS; 1- and 3,.the board 50AJs for use withtfie second 65 
embodiment to=be de^ibed^later^That is, the board 50& 
carries"spare.driyes V i5. ^ 
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(2) Outline of the system operation in the absence of 
faulty disc drives 

The internal workings of the first embodiment will now be 
described with reference to FIG. 3. The first embodiment 
works primarily on the so-called RAID 3 control principles. 

The CPU 1 first issues a read or write command. The 
command is input to a channel path director 8 in the array 
controller 2 via the channel paths 7. 

Given the command from the CPU 1, the channel path 
director 8 in the array controller 2 checks to see if the 
command is acceptable. When the command is found to be 
acceptable, that command sent from the CPU 1 to the array 
controller 2 is input to the interface adapter (IF Adp) 10. The 
microprocessor (MP) 17 performs command acceptance 
processing. The first embodiment writes and reads data 
primarily in accordance with the conventional RAID level 3 
scheme. 

FIG. 6 depicts how the first embodiment writes data as per 
the RAID level 3 scheme. In FIG. 6, reference characters 
BADRi (i=l, 2, 3 or 4) represent the board addresses of four 
drive boards 5 A. Reference characters SD#1 (i=l, 2, 3 or 4) 
denote the drive address specifying one disc drive on each 
of the boards. 

In the description that follows, the board whose board 
address is, say, BADRI is called the board BADRI. Like- 
wise the drive whose drive address is SD#1 on a board is 
called the drive SD#1. . 

The drives SD#1 on the boards BADRI through BADR4) 
are assumed to constitute a logical group. It is also assumed 
that the disc drives on the board BADR4 are used to store 
parity data and that the disc drives SD#1 on the other boards 
BADRI through BADR3 are used to accommodate data. 

It is further assumed that the command from the CPU 1 
is a data write command and that the write data sent from the 
CPU 1 is represented by D#l in FIG. 6. D#l is a logical 
address that specifies the write data. The data specified by 
the logical address D#l will be called the data D#l here- 
under. 

With the first embodiment, the length of the data written 
or read by the CPU 1 is assumed to be fixed at 4 KB. Thus 
the write data D#l from the CPU 1 is 4 KB long. It should 
be noted that the data length is not limitative of the inven- 
tion; any other appropriate data length may be adopted for 
the invention. 

The write data D#l from the CPU 1 is held in the cache 
memories 4 in FIG. 1. Under the RAID 3 scheme, the write 
data D#l thus accommodated is divided into single-byte 
data units from the beginning of the data, and the data units 
are distributed to the three data storage drives (SD#1 on 
BADRI, BADR2 and BADR3) out of the four constituting 
the logical group. The divided data units are called sub data 
having sub data names D#l-1, D#l-2 and D#l-3. The sub 
data whose sub data name is D#l-1 will be called the sub 
data D#l-1 hereunder. Parity data is generated from the sub 
data following the division. Parity data generation involves 
processing the corresponding bytes of the sub data. Specifi- 
cally, the first byte of the sub data D#l-1, the first byte of the 
sub data D#l-2 and the first byte of the sub data D#l-3 are 
computed to give a parity data item; the second byte of the 
sub data D#l-1, the second byte of the sub data D81-2 and 
the second byte of the sub data D#l-3 are computed to 
furnish another parity data item; and so on. The parity data 
items thus generated are arranged consecutively to constitute 
parity data P#l. 

The sub data D#l-1, D#l-2, D#l-3 and the parity data P#l 
are written in a distributed manner to the four disc drives 
making up the logical group. The data from the CPU 1 is 
written in the above-described manner. 
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When the CPU 1 issues a data read request, the logical 
group holding the requested data is accessed for reading 
therefrom the sub data constituting the target data. These sub 
data arc combined and transmitted via the cache memories 
4 to the CPU 1. 5 

(3) Address translation table 

The address translation table necessary for the above- 
described writing and reading of data will now be described 
in detail. 

For the disc drives 5 constituting the disc array system, 
the first embodiment adopts SCSI interface drives. In addi- 
tion, the CPU 1 of the first embodiment issues write and read 
commands without getting aware of disc arrays. That is, the 
CPU 1 is not aware whether the disc drives to which to issue 
commands are of conventional type or are part of a disc 
array system. This is because the disc drives widely used 15 
today are of conventional type so that the operating systems 
(OS) are not designed to be aware of disc arrays. Thus with 
the first embodiment, the CPU 1 operates on the assumption 
that it writes and reads data to and from conventionally 
configured disc drives; the disc array system carries out its 20 
own processing associated with disc arrays. For this reason, 
the CPU 1 issues write and read commands using a con- 
ventional interface arrangement. 

More specifically, when the CPU 1 sends a data write 
command to the array controller 2, the CPU 1 transfers the 25 
target write data together with its logical address (data name 
or data number). The array controller 2 divides the write data 
into sub data as described with reference to FIG. 6, and 
stores these sub data as well as the associated parity data to 
the disc drives of the logical group 9. At this time, the sub 30 
data and parity data are given addresses (D#l-1 through 
D#l-3, P#l, etc. as explained by referring to FIG. 6) for 
processing by the array controller 2. The logical address 
furnished by the CPU 1 and the addresses prepared by the 
array controller 2 are matched in what is known as the 35 
address translation table, to be described below. 

When the CPU 1 issues a data read command to the array 
controller 2, the CPU 1 furnishes the array controller 1 with ' 
the applicable logical address for a read operation by the 
latter. Using the address translation table, to be described 40 
below, the array controller 2 translates the furnished logical 
address into appropriate addresses used within the disc array 
system. These addresses are referenced in order to read the 
sub data which are then combined and transferred to the 
CPU 1. 45 

The address translation table is constituted by a logical 
group table 21 and an address table 31. 

FIG. 4 illustrates a typical structure of the logical group 
table 21. Logical addresses 22 are each an address desig- 
nated by the CPU 1 (i.e., data name or data number). Sub 50 
data names 24 are acquired by dividing data having a given 
logical address 22. Parity data names 25 are prepared on the 
basis of the corresponding sub data. Logical group numbers 
23 are each the number of a logical group 9 that currently 
stores or is expected to store the sub data and parity data. 55 

In an initial state of the address translation table in which 
no data is written, the column of logical group numbers 23 
has predetermined logical group numbers set therein as 
initial values. In the initial state, the columns of logical 
addresses 22, of sub data names 24 and of parity data names 60 
25 are all left blank in the table. 

FIG. 5 depicts a typical structure of the address table 31. 
The address table 31 retains detailed address information 
about what data has been (or is to be) written in which 
location of which drive belonging to which logical group. 65 

Logical addresses 22 in the address table 31 of FIG. 5 are 
the same as the logical addresses 22 in the logical group 
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table 21 of FIG. 4. These are logical addresses (data names 
or data numbers) each designated by the CPU 1. Logical 
group numbers 23 in the address table 31 of FIG. 5 are the 
same as the logical group numbers 23 in the logical group 
table 21 of FIG. 4. Reference numeral 30 represents SCSI 
drive address information about the SCSI drive to which 
data of the corresponding logical addresses has been (or is 
to be) written. 

The SCSI drive address information 30 comprises four 
columns of address information. Each column includes a 
board address 27, drive numbers 28, fault flags 29 and a 
board withdrawal flag 100. The board withdrawal flag 100 
indicates whether or not the board having the listed address 
is withdrawn from the mother board. Reference numeral 32 
denotes inter-drive addresses each indicating what data has 
been (or is to be) written in which location of which drive. 
A board address 27, a drive number 28 and an inter-drive 
address 32 corresponding to a given logical address 22 
combine to denote the location to which the data at that 
logical address 22 (sub data and parity data) has been (or is 
to be) written in which drive on which board. 

The board withdrawal flag 10 for each board is set to 1 
when that board is withdrawn from the mother board 37. 
When withdrawing a board, the user gets the CPU 1 to issue 
to the array controller 2 a command indicating that the board 
has been withdrawn. Responding to the issued command, 
the microprocessor 17 sets the board withdrawal flag 10 
corresponding to the indicated board. Later, when the 
detached board is replaced onto the mother board, the user 
also gets the CPU1 to notify the array controller 2 using a 
conunand that the board in question has been replaced. In 
response, the array controller 2 performs recovery process- 
ing, to be described later, and resets the board withdrawal 
flag-100-corresponding-to the replaced board, 

The fault .flag^mdicates whether, or jot the correspond-^ 
ing disc drive has. developed a fault That is, a]fau)t flag ?9 
remains off (at 0) when everything is normal, and is turned 
on (i.e., set to 1) if^jcorresponding disc drive on the 
corresponding board has developed a fault and is inacces- 
sible for read or write operations "thereto? — - 

The four columns under the SCSI drive address informa- 
tion 30 correspond respectively to the columns under the sub 
data name 24 and to the column under the parity data name 
25 in the logical group table 21 of FIG. 4. 

Because the first embodiment has its data stored in 
parallel in a plurality of regions having the same inter-drive 
address 32, it is preferable to synchronize the revolutions of 
all disc drives 5 within the same logical group 9. 

As described, the first embodiment reads and writes data 
using the address table 31 and logical group table 21. 

(4) System operation upon disc drive fault 

With the first embodiment, as shown in FIG. 6, each 
logical group 9 comprises a plurality of disc drives mounted 
on a plurality of boards (e.g., BADR1, BADR2, BAJDR3, 
B ADR4). The address translation tables depicted in FIGS. 4 
and 5 indicate which disc drives on which boards constitute 
what logical group, the indication being given by having the 
address of each disc drive supplemented with the applicable 
board address. 

Suppose that the first embodiment attempts a number of. 
times to access a disc drive on, say, the board BADR1 for 
writing or reading data thereto or therefrom. If a predeter- 
mined number of retries turn out to be unsuccessful, the 
microprocessor 17 recognizes a fault of the board BADR1 
and, in the address table 15, turns on (i.e., sets to 1) the fault 
flag 29 corresponding to the disc drive SD#1 on the board 
BADR1. With the disc drive SD#1 in defective state, the disc 
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array system remains operational even if the mother board 
37 still retains the board BADR1 carrying that faulty drive 
or if the defective board is detached from the mother board 

37 for repair. _____ _____ 

Specifically,' when the CPU 1 issues a data reaTrequesiD 5 
pertaining to data held in any of the disc drives SD#lon the 
-boards BADR1-BADR4, the microprocessor 17 checks the 
address" table 15. In the manner described in connection with 
the address translation table, the microprocessor 17 trans- 
lates the logical address designated by the CPU 1 into the 10 
address applicable within the~disc array system.- At_this 
pqint^thenu^roprocessor.lXchecks^ flag 
29 corresponding to Jhe disc drive SD#1 on the board 
(BADR1 in the above example) to which the "translated 



the disc drive SD#1 has developed a fault and that the board 
BADR1 carrying that drive^SD#l„stays inserted_in_the 
mother board 37. In that case, the microprocessor 17 accom- 
modates the write data inside the cache memories 4. 

Latere when the disc drive SD#1 on the detachedboard 
BADR1 is repaired andjhe board BADR1 is replaced onto 
the mother board 37,"the microprocessor 17 reads the sub 
data and parity data belonging to all error correction data 
groups of the other disc drives that belong to the same 
logical group as that of the normal alternate disc drive" 
replacing the faulty drive (drive SD#1 on the board BADR1 
in this example). Using the sub data and parity data thus 
read, the microprocessor 17 recovers all data in the faulty^ 
^disc drive. The recovered data is stored in the normal' 1 



address is assigned, and the board withdrawal flag_10fl^ 15< alternate disc drive replacing the faulty drive. 



corresponding to the board^arrying„that.disc_drive. If the, 
fault flag 29 regardiriglhedisc drive SD#1 is found to be of^ 
(i.e., set to 1) and if the board withdrawal flag 100 applicable^ 
to the board BADR1 carrying Jhat disc drive isbff (set to 0), 
the microprocessor 17 ^ recognizes two things:" tharthe disc 
drive in question has^ developed a fault, and "that the board 
BADR1 carrying" that drive is still held inserted^ in ^ie 
mother board 37.- 

Having recognized the faulty disc drive in the above 
manner, the microprocessor 17 recovers the data in the disc 
drive SD#1 on the board B ADR1 through the use of the data 
and parity data associated with theological group tqjwhich 
the drive in question belongs. Specifically, the microproces^ 
sor 17 retrieves a plurality of sub data and parity data from 
v 'the drives SD#1 on the other boards B ADR2-B ADR4. 
Using the retrieved sub data and parity data, the micropro- 
cessor 17 gets the internal parity generator 18 to recover the 
"sub data in the faulty disc drive SD#1. The recovered suli 
data is combined with the multiple sub data retrieved earlier 
so as to generate the requested data which is then sent to the 
CPU 1. It is by a known, technique that the parity generator 
18 recovers the data in the faulty disc drive^D#l? 

As described, the parity generator (PG) 18 generates 
parity data from sub data when target data is written to disc 
drives. When the sub data is recovered from both sub data 
and parity data, on the other hand, the parity generator 18 
acts as a data recovery circuit. 

If the board withdrawal flag 100 corresponding to the 
board BADR1 carrying the disc drive SD#1 is found to be 
on (set to 1), the microprocessor 17 recognizes that the board 
BADR1 has been withdrawn from the mother board 37. In 
that case, the microprocessor 17 is unable to read data from 
all disc drives, with or without fault, mounted on the board 
BADR1. Thus in the same manner as that of handling the, 
faulty drive SD#1 described above, the microprocessorJL7 50 
recovers data in the unsuccessfully accessed drive from the ' 
data in the other disc drives belonging to the same logical 
group. _ 

If the CPU 1 issues a data write request for updating'data 
in the disc drive SD#1, the microprocessor 17 checks the 
address table as in the case of the read request Following the 
procedure discussed in connection with the address transla- 
tion table, the microprocessor 17 translates the logical 
address designated by the CPU 1 into the address applicable 
within the disc array system. At this point, the micropro- 
cessor 17 checks the fault flag 29 corresponding to the disc 
drive in question and the board withdrawal flag 100 appli- 
cable to the board carrying that disc drive. 

If the fault flag 29 corresponrling to the disc drive SD#1 
is found to be on (set to 1) and if the board withdrawal flag 
-100 applicable to the board BADR1 carrying the disc drive 
SD#1 is off (set to 0), the microprocessor 17 recognizes that 
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Thereafter, the multiple write data" held in the cache \ 
memories 4 are written to the alternate disc driye^orxto the s 
other appropriate normal drives on the board B ADR1. The 
writing of these write data is carried out under the RAID 3 
scheme. That is, each write data item is divided into a 
plurality of sub data from which parity data is generated. 
One of the plurality of sub data is written to one disc drive 
on the board; the other sub data and the parity data generated 
above are written to a plurality of disc drives on the other 
boards. 

When the detached board BADR1 is replaced onto the 
mother board 37, the above-described scheme recovers all 
data in the normal drives on the replaced board B ADR1. The 
scheme also recovers the data for which no write request was 
made while the board BADR1 was withdrawn from the 
mother board 37. This scheme is effective where most of the 
data in the normal drives has been updated in response to 
numerous write requests issued while the board BADR1 was 
detached. However, the scheme takes time needlessly for 
recovery processing if it recovers the write data already held 
in the normal drives for which few write requests were 
issued while the board BADR1 was withdrawn from the 
mother board 37. 

This flaw is circumvented by not recovering the write (iaU? 
already held in the normal drive for which only a small 
number of write requests were issued while the board 
BADR1 was detached. Instead, only the data designated by ; 
the write requests from the upper device with the board 
detached are recovered. . - •- j 

If the board withdrawal flag 100 regarding the board 
BADR1 carrying the disc drive SD#1 is found to be on (set 
. to l),^he microprocessor 17 recognizes that the board 
jiADRl has been withdrawn from the mother board 37. In 
that case, the microprocessor 17 is unable to write data to all 
drives, with or without fault, on the board BADR1. Thus in 
the same manner as that of handling the faulty drive SD#1 
described above, the microprocessor 17 places in the cache, 
memories 4 the data to be written to that drive SD#L ,..__. 

Later, when the disc drive SD#1 on the board BADR1 is 
replaced by a normal alternate drive and the board B ADR1 
is mounted back onto the mother board 37, the micropro- 
cessor 17 reads the sub data and parity data belonging to all 
error correction data groups of the__other disc drives_that 
belong to the same'logical group asjhat-of- the^normal 
alternate disc drive replacing the faulty drive. Using the sub n 
data and parity data thus read, the microprocessor 17 recov- 
ers all data in the faulty disc drive and writes the recovered 
data to the alternate disc drive. -Thereafter, the write data 
held in the cache memories 4 is written to its appropriate 
destination. 

With the first embodiment, it is impossible to access all 
normal drives on the board BADR1 as long as that board is 
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detached from the mother board 37. While the board 
BADR1 carrying the faulty drive remains withdrawn from 
the mother board, it is necessary to write or read data to or 
from any of these normal disc drives in the same manner as 
with the faulty drive. However, : when a data write request is) 5 
issued to one of the normal drives on the detached board ano/ 
the board is later replaced onto the mother board 37, it is nop 
necessary to recover the data in the target normal drive as' 
has been the case with the normal drive having replacedthe 
faulty drive. This is because- the target normal drive noT 10 
having replaced the faulty one keeps intact its data held 
before the board was withdrawn. " 
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Because the newly generated write data is retained irTthe* 
cache memories 4, the write data generated while the boara* 
was being detached from the mother board 37 is easily 
^ written to any normal disc drive on the replaced board. 

Because the cache memories 4 hold the write^dataTa read 
request that may be issued later by the CPU 1 to read the 
retained data causes them to be read from the cache memo- 
ries 4 for direct transfer to the CPU 1. This effectively 
amounts to increasing the data read speed. 

As described, where a drive board carrying a faulty disc 
drive is withdrawn from the mother board, the first embodi- 
ment can continuously handle data read and write requests 
destined to the faulty disc drive. 
(First variation of the first embodiment) 

Where a board carrying a faulty disc drive (i.e., SD#1) is 
detached from the mother board, a data write request issued 
by the CPU 1 to write data to the faulty drive SD#1 (or to 
any other normal disc drive on the same board) may be 
processed alternatively as follows: 

The write data is divided into sub data and parity data is 
generated in conjunction therewith, in the same manner as 
when the board BADR1 is still attached to the mother board. 
When the board BADR1 is withdrawn from the mother 
board, the sub data or the newly generated parity data is 
written to the disc drives SD#1 or to other drives on the ■ 
boards BADR2-BADR4 other than the board BADR1 car-^ 
rying the faulty drive SD#1 (or other normal drives); the 
data-receiving disc drives here constitute the logical group 40 
to which the faulty drive SD#1 (or other normal drives) on 
the board BADR1 belongs. The sub data to be written to jhe 
.drive SD#1 or to other normal drives on the detached board 
BADR1 is discarded. 

Later, when the board BADR1 is again inserted irTthei 45 
mother board 37, data is recovered for all drives mounted on^ 
the replaced board BADR1 as in the case of the first 
embodiment. In this recovery process, the initial data in the 
faulty drive and the write data furnished to service the write 
request issued while the board is being detached are recov- 
ered inside the alternate drive replacing the faulty drive on 
the board in question. At the same time, it is also possible to 
recover in the normal disc drives on the same board the data 
held by these normal drives since before the occurrence of 
the fault as well as the write data generated in connection 
with the write request issued while the board was being 
detached. 

With the first variation of the first embodiment, it takes 
time to recover all data in the disc drives if the drives are 
numerous on the detached board. However, one advantage 
of the first variation is that there is no need to retain the new 
write data in the cache memories 4 as has been the case with 
the first embodiment This means that the cache memories 4 
may be reduced in size. 
(Second variation of the first embodiment) 

Suppose that, as in the case of the first variation of the first 
embodiment, the CPU1 issues a data write request while the 
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board BADR1 carrying the faulty drive is being ~wu^drawn 
from the mother board. In that case, the data is divided into 
a plurality of sub data and parity data is generated in 
conjunction therewith. Of these sub data, those other than 
the sub data destined to the disc drives on the board BADR1 
along with the generated parity data are written to a plurality 
of disc drives on different boards other than the board 
BADR1. The difference between the first and the second 
variation of- the first- embodiment ismaUwith_the~second 
variation, if one of the sub data is destined to any normal 
disc drive on the board BADR1, that sub data is retained in 
the cache memories 4. Later, when the board BADR1 is^ 
replaced onto the mother board, the second variation of the 
first embodiment recovers all data to be held in the alternate 
disc drive that replaces the faulty drive on the board BADR; 
the recovery of the data is achieved by use of the data in the 
other drives. Hie sub data in the cache memories 4 is written 
to the normal drives on the board BADR1 as in the case of 
the first embodiment. 

The second variation of the first embodiment requires a 
smaller quantity of sub data to be held in the cache memories 
4 than the first embodiment. When the board BADR1 is 
replaced onto the mother board, the second variation has no 
need for the recovery of data for the normal drives on the 
board BADR1 as carried out by the first variation. 
(Third variation of the first embodiment) 

Whereas the first embodiment uses the bus 37 as a 
high-speed bus for shared use by a plurality of drive boards, 
another alternative is to utilize a dedicated bus arrangement 
that connects each drive board to the array controller 2 on a 
one-to-one basis. Such a dedicated bus arrangement allows 
a plurality of disc drives 5 to operate parallelly on a 
concurrent basis. 

(Second embodiment) — 

The second embodiment is a modification of^the jirst! 
embodiment as it is supplemented by a spare board 50 A 
shown by broken line in FIG. 1 or 3. In FIG. 2B, a spare 
board connector is designated as 500A. The spare board 50A 
is identical in structure to the other drive boards. The spare 
board carries the same number of disc drives as each of the 
other boards. With the second embodiment, each spare drive 
corresponds to each of a plurality of logical groups consti- 
tuted by the drives on the other multiple boards. If any onei 
drive of a given logical group develops a fault, the spare 
drive corresponding to that group replaces the faultyjdriye. 
In the description that follows, any drive board other than 
the spare board will be assigned a number BADRi (i==l72> 
3 or 4) as in the first embodiment, and the spare board will 
be given a board number BADR5. 

With the spare drives furnished as outlined, the logical 
group table 21 in FIG. 4 for the first embodiment is replaced 
by the logical group table 21 in FIG. 7 for the second 
embodiment The logical group table 21 in FIG. 7 includes 
a data name column 26 for holding the names of the data to 
be stored in the spare drives. The address table 31 in FIG. 5 
for the first embodiment is replaced by the address table 31 
in FIG. 8 for the second embodiment The address table 31 
of FIG. 8 differs from that of FIG. 5 in that the total number 
of drives constituting each logical group is 5 including one 
spare drive. 

Described below are the workings of the second embodi- 
ment in effect when any disc drive develops a fault It is 
assumed that, as an example, the drive SD#1 on the board 
BADRI has developed a fault 
(A) System operation related to the faulty drive 
(Al) Where the board carrying the faulty drive is still 
inserted in the mother board 
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(A 1 a) Reading data from- the faulty "drive 
Suppose that, while the board BADRl^earrying the faulty 
drive is still inserted in the mother board 37, the CPU 1 has 
issued a data read request for reading data partially including 
the sub data held in the faulty drive. In that case, as with the 5 
first embodiment, the second embodiment recov ers th e .s ub 
data in the faulty drive from the sub data and parity datajn 
the other drives belonging to the same logical group as that 
of the faulty drive. In recovering the target sub data, the 
second embodiment reads the sub data from the drives SD#1 10 
on the boards BADR2 and BADR3 as well as the parity data 
from the drive SD#1 on the board BADR4. The recovered 
sub data is combined with the other two retrieved sub data 
for transfer to the CPU 1. 

The subsequent workings make the second embodiment 15 
different from the first. Specifically, the recovered sub dafa 
is written to the spare drive furnished for the logical group 
in question, i.e., to the drive SD#1 on the board BADRSih 
the above example. Thereafter, the spare drive takes oveP 
from the faulty drive in dealing with that particular sub data. ' 20 
That is, if the CPU 1 issues another data read request for " 
reading the data designated by the preceding data read 
request, or if the CPU 1 issues a data update request 
regarding the same data, the read or write operation is 
carried out on the four normal disc drives including the spare 25 
drive. 

Needless to say, address SADR1 of the spare drive SD#1 
on the spare board BADR5 is updated appropriately in the 
address table 31 A so that the spare drive SD#1 will take over 
from the faulty drive. 30 

As is clear from the description above, the use of spare 
drives allows the second embodiment to dispense with 
recovering sub data if the data once accessed is again 
accessed, whereas sub data recovery is required of the first 
embodiment. And unlike the first embodiment, the second 35 
embodiment in updating the data may execute the write 
request without retaining the write data in the cache memo- 
ries 4. 

(Alb) Write operation on the data in the faulty drive 

Suppose that, where a board carrying a faulty drive is 40 
mounted on the mother board 37, the CPU 1 issuesTdata 
write request for the data including the sub data TielHTfifSie 
faulty drive. In that case, unlike the first embodiment, the 
second embodiment executes the write request immediately. 

That is, as in the case of the normal write operation, the 45 
write data is divided into three sub data and parity data is 
generated in conjunction therewith. The difference from the 
normal write operation is that, of these sub data, the sub data" 
to be written to the faulty drive SD#1 on the board BADRt 
- is written to the drive SD#1 on the spare board BADR5. 50 

Needless to say, the address table 31A is updated appro- 
priately so that the spare drive SD#1 will take over from the 
faulty drive. 

(A2) System operation in effect when the board carrying 
a faulty drive is detached from the mother board 55 

In tins case, the operations for writing and reading data to 
and from the faulty drive are the same as the data read 
operation (Ala) and as the data write operation (Alb) with 
the faulty drive mounted on the mother board. 

(A3) System operation in effect when the board carrying 60 
a faulty drive is replaced onto the mother board 

Where the board B ADR1 with its faulty drive replaced by 
a normal drive is inserted back into the mother board 37, the 
second embodiment recovers the data in the initial faulty' 
drive in the same, manner as with the first embodiment The 65 
recovered data is written to the normal disc drive that has 
replaced the faulty drive. During recovery, there is no need 
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to use the data held in the spare drives. This scheme also 1 
makes it possible to recover the data written anew from the. 
CPU 1 after the fault occurred in the drive.' 

It may happen that a faulty drive oh the spare board 
BADR5 is replaced by another spare drive. In that case, the 
new spare drive will thereafter replace any of the other 
drives constituting the same logical group if the latter drive 
has developed a fault. 

(B) System operation related to the normal drives on the 
board carrying the faulty drive (Bl) Where the board having 
the faulty drive is mounted on the mother board 

As in the case of the first embodiment, the normal drives 
on the board carrying the faulty drive are used as they are 
continuously. 

(B2) System operation in effect when the board carrying 
the faulty drive is detached from the mother board 

In this case, the normal drives on the board carrying the 
^faulty drive cannot be accessedt^This makes it necessary to . 
carry out the operation described in (A I) above in connec- 
tion with the workings of the faulty drive. 

(B3) System operation in effect when the board carrying 
the faulty drive is replaced onto the mother board 

Of all the data held in any normal drive on the detached 
board carrying the faulty drive, the initial data in that normal 
drive to which a write request has been issued after with- 
drawal of the board from the mother board cannot be used. 
As mentioned earlier, the data generated in response to the 
write request and destined to be written to the normal drive 
is already written in the spare drive corresponding to that 
normal drive. When the board that had the faulty drive is 
replaced onto the mother board 37, the effective data held in 
the spare drive corresponding to the normal drive in question 
is moved to the now normal drive on the same board. This 
completes the recovery of the data for any normal drive on 
the detached board having carried the faulty drive. 

If the CPU 1 issues a read request regarding data in a 
normal drive on the currently detached board carrying the 
faulty drive, the data in that normal drive is recovered and 
placed in the spare drive corresponding to that normal drive, 
as described. Thus the data recovered in response to such a 
read request is also moved from the spare drive to the normal 
drive. It should be noted, however, that the data thus 
recovered in response to the read request is the same as the 
data held initially in the normal drive. Thus the above 
movement of the data leaves intact the data in the normal 
drive. 

Unlike the first embodiment, the second embodiment has 
spare drives that accommodate the data recovered in 
response to the read request issued by the CPU 1 while the 
drive board in question was being detached from the mother 
board 37. This means that if the CPU 1 later requests any of 
the data in the spare drives, the data is immediately avail- 
able. As opposed to the first embodiment that holds write 
data in the cache memories 4, the second embodiment 
accommodates write data in the spare drives. The second 
embodiment thus can retain much more write data than the 
first embodiment. 

(Variations of the second embodiment) 

A number of variations of the second embodiment dis- 
cussed below may be conceived in connection with the 
recovering of data after the board that had a faulty drive is 
replaced onto the mother board. 

(1) Variations of the second embodiment in connection 
with the recovery of data in the faulty drive 

(la) With the above-described second embodiment, all 
data to be retained in the drive having replaced the faulty 
drive is recovered by use of a plurality of sub data and parity 
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data in the other drives within the same logical group to 
which the faulty drive belongs. The data thus recovered is 
stored in the drive that has replaced the faulty drive. 

The spare drive that replaces the faulty drive already 
contains part of the data to be held in the faulty drive. Thus 5 
when the second embodiment is to recover data destined to 
the drive that has replaced the faulty data, the data other than 
that in the spare drive may be recovered and stored in the 
drive having replaced the faulty one; thV~~data "already 
retained in the spare drive is moved to the drive that took to 
over from the faulty drive. The scheme of this variation 
achieves data recovery more quickly than the second 
embodiment. This scheme is effective where the spare drive 
contains large quantities of data. 

(lb) With the second embodiment, when the board that 15 
had the faulty drive is replaced onto the mother board, the 
data destined to be held in the initial faulty drive is restored 
into the drive that has replaced the faulty drive. The spare 
drive used for the faulty drive is released anew as a spare 
drive. This scheme may be replaced by another alternative^ 20 
scheme whereby the data to be held in the faulty drive is 
recovered and written to the corresponding spare drive, the 
spare drive being used thereafter as a normal drive. In that 
case, of all data to be retained in the initial faulty drive, the 
data yet to be stored in the spare drive may selectively be' 25 
recovered and written to the latter drive. This procedure is 
preferred because it shortens the time required for data 
recovery. 

This alternative scheme is effective where a significant 
portion of the data to be held in the initial faulty drive is 30 
already retained in the spare drive. 

(lc) With the second embodiment, when the upper device 
issues a read request regarding data in the faulty drive, the 
issuance of the request causes the data to be recovered and 
stored into the corresponding spare drive. Alternatively, the 35 
process of storing the recovered data into the spare drive 
may be omitted. That is, the spare drive retains only the write 
data supplied by the upper device while the board carrying 
the faulty drive is being detached from the mother board. 
Obviously, after the board is replaced onto the mother board, 40 
the data in the faulty drive may be recovered and placed in 
the spare drive which in turn may be used in place of the 
faulty drive. 

(Id) Another alternative is that, if any drive develops a 
fault, the upper device may immediately inhibit access to the 45 
disc array in question. When all data in the faulty drive is 
recovered and stored in the corresponding spare drive, the 
spare drive may be used in place of the faulty drive. 

(2) Variations of the second embodiment in connection 
with normal drives on the board carrying a faulty drive 50 

(2a) With the second embodiment, after the board that had 
the faulty drive is replaced onto the mother board, the data 
in the spare drives corresponding to the normal drives on the 
same board is all moved back to the normal drives. It should 
be noted, however, that the data in the spare drives includes 55 
data portions recovered in response to read requests issued 
for data in a normal drive on the board carrying the faulty 
drive while that board was being detached. Such data is the 
same as the data which is already retained in the normal 
drive and which has been in effect since before the board 60 
was detached. Thus the movement of data from the spare 
drive to the normal drive in the second embodiment should 
preferably exclude the data recovered in response to such 
read requests. This scheme is desirable where the movement 
of data is to be accomplished more quickly. 65 

Tb implement the above scheme requires setting to the 
address table the information indicating, when data is writ- 
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ten to the spare drive, whether that data has been written in 
response to a read request. In accordance with that infor- 
mation, whether or not to move the data from the spare drive 
to the normal drive on the replaced board may be controlled. 
This scheme is effective where there exists more data written 
in the spare drive in response to write requests from the CPU 
1 than the data written thereto in response to read requests 
from the CPU 1. 

(2b) With the second embodiment, when the board that 
had the faulty drive is replaced onto the mother board, the 
data written in the spare drive is moved therefrom to the 
corresponding normal board. Alternatively, the data to be 
held in the replaced normal drive may be recovered and 
stored into the spare drive which in turn may be used 
thereafter as a normal drive. In that case, of all data to be 
held in the normal drive, the data yet to be retained in the 
spare drive is to be recovered selectively and then written to 
the spare drive. This scheme is effective where a consider- 
able portion of the data to be held in the normal drive is 
already retained in the spare drive. 
(Third embodiment) 

Whereas the first and the second embodiment are each a 
disc array system operating on the RAID 3 control prin- 
ciples, the third embodiment of the invention is a disc array 
system that adopts the RAID 5 scheme for its control^Tbus 
although the third embodiment has the same system" con- 
figuration as that in FIG. 1, the workings of the micropro- 
cessor 17 for control of the system differ from those for 
control of the first embodiment. In describing the third 
embodiment below, reference will be made to FIG. 1 and 
other circuit diagrams attached. The description of the third 
embodiment will emphasize its differences from the first 
embodiment. FIG. 9 is a view of a typical address table for 
use with the third embodiment. 
(System operation in the absence of faulty drives) 

The system operation of the third embodiment in the 
absence of faulty drives is primarily the same as that which 
takes place under the RAID 5 scheme. How the system 
works basically will be briefly described below for a better 
understanding of the RAID 5 scheme. 

In the RAID 3 setup, the write data corresponding to one 
write request is divided into a plurality of sub data from 
which parity data is generated. The multiple sub data and 
parity data thus generated are written separately to different 
disc drives. In the RAID 5 setup, by contrast, the data 
corresponding to one write request is not divided. A plurality 
of data items corresponding to a plurality of write requests 
are used to define parity data. The multiple data and the 
parity data generated in conjunction therewith constitute one 
parity group. The data and the parity data are written 
separately to different disc drives. However, it is not nec- 
essary that all data belonging to the same parity group be 
prepared along with their parity data before it is written to 
a plurality of disc drives. That is, every time a write request 
is issued by the CPU 1, the system generates new parity data 
based on the corresponding write data and on the already 
generated parity data (old parity data) regarding the parity 
group to which the write data in question belongs. The write 
data and the generated parity data are written separately to 
two disc drives. When the CPU 1 issues a read request, the 
data designated by that request is read from the single drive 
where the data is held; the retrieved data is sent to the CPU 
1. 

More specifically, a plurality of storage locations belong- 
ing separately to a plurality of disc drives are assigned to 
each parity group. When the CPU 1 issues a write request to 
the array disc controller 2, the logical address designated by 
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the write request is assigned a parity group and the storage 
location in any one drive allocated to that parity group. 
Where no data is held yet in the other storage locations 
allocated to the parity group in question, the remaining data 
of that parity group is regarded as consisting of a predeter- 5 
mined value (e.g., 0). Parity data is generated from the write 
data in question. Hie write data and the parity data thus 
generated are written separately to different disc drives. 

When the same parity group is assigned to a second write 
request, the previously generated parity data (old parity data) 1 0 
regarding the parity group is read out. New parity data is 
generated from the old parity data and from the second data 
designated by the second write request. The second write 
data is written to the storage location assigned to the data, 
while the new parity data replaces the old parity data. The is 
remaining data of the parity group is thereafter written in 
like manner. Thus the RAID 5 setup differs from the RAID 
3 setup in that every time data is written, new parity data is 
generated from the write data in question and from the old 
parity data of the parity group to which the data belongs. 20 
What is noticeable here is that the other data of the same 
group is not used in generating the new parity data. When 
currendy retained data is read from any one disc drive, the 
other data of the same parity group to which the written data 
belongs is not read out 25 
(System operation upon disc drive fault) 

The system operation of the third embodiment in case of 
a disc drive fault is primarily the same as that of the first 
embodiment with the exception of differences stemming 
from what sets the RAID 5 scheme apart from the RAID 3 30 
scheme. Illustratively, suppose that where a board having a 
faulty drive is mounted on the mother board, the system 
executes a write request from the CPU 1 for writing data to 
a normal drive. In this case, as in the case where no faulty 
drive exists, there is no need to divide the write data. 35 
Suppose also that while the board carrying the faulty drive 
is being detached from the mother board, the host device 
issues a write request to that faulty drive. In this case, the 
write data corresponding to the write request is accommo- 
dated temporarily in the cache memories 4. 40 

When the board having the faulty drive replaced by a 
normal alternate drive is inserted back into the mother board, 
the data to be held in the alternate drive rs, recovered and 
written thereto. Thereafter, the write data retained in the 
cache memories 4 is written without being divided to the > 45 
alternate drive. The same holds for the normal drives on the 
board having the faulty drive, except that there is no need to 
recover the data for these normal drives. 
(Variations of the third embodiment) 

The variations of the first embodiment may also be 50 
applied to the third embodiment. However, there need to be 
additional modifications attributable to the differences 
between the RAID 3 and the RAID 5 scheme. 

For example, suppose that with the first variation of the 
first embodiment, the CPU1 issues a write request to a faulty 55 
drive while the board having that faulty drive is being 
withdrawn from the mother board. In this case, as described, 
the write data is divided into a plurality of sub data and 
parity data is generated in conjunction therewith. Of these 
sub data, the sub data other than those to be written to the 60 
faulty drive as well as the newly generated parity data are 
written separately to a plurality of disc drives. A third 
embodiment variation corresponding to this first variation of 
the first embodiment has no need to divide the write data. 
Suppose that the upper device supplies write data while the 65 
board having the faulty drive is being detached from the 
mother board. In this case, of the multiple data of the parity 
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group to which the write data belongs, the data held in the 
drives other than the faulty drive as well as the parity data 
applicable to that parity group are read out. New parity data 
is generated from the data thus read out and from the write 
data supplied anew from the CPU 1. Then the newly 
generated parity data need only replace the old parity data. 

The third embodiment may also be applied to RAID 4 
array controllers. 
(Fourth embodiment) 

The fourth embodiment of this invention is the disc array 
system of the second embodiment adapted to operate on the 
RAID 5 control principles. Thus the fourth embodiment, as 
with the second embodiment, is implemented using the same 
system constitution as that in FIG. 1 supplemented by the 
spare drive board S0A. What makes the fourth embodiment 
different from the second embodiment is the manner in 
which the system is controlled by the microprocessor 17. In 
describing the fourth embodiment below, reference will be 
made to FIG. 1 and other circuit diagrams attached. The 
description of the fourth embodiment will emphasize its 
differences from the second and the third embodiment. 
(System operation in the absence of faulty drives) 

The system operation of the fourth embodiment in the 
absence of faulty drives is primarily the same as that of the 
second embodiment with the exception of differences stem- 
ming from what sets the RAID 5 scheme apart from the 
RAID 3 scheme. These differences have already been 
described in connection with what makes the third embodi- 
ment different from the first embodiment. 
(System operation upon disc drive fault) 

The system operation of the fourth embodiment in case of 
a disc drive fault is primarily the same as that of the second 
embodiment with the exception of differences stemming 
from what sets the RAID 5 scheme apart from the RAID 3 
scheme. These differences have already been described in 
connection with what makes the third embodiment different 
from the first embodiment. 
(Variations of the fourth embodiment) 

The variations of the second embodiment may also be 
applied to the fourth embodiment. However, there need to be 
additional modifications attributable to the differences 
between the RAID 3 and the RAID S scheme. 

For example, where the CPU1 issues a write request to a 
faulty drive and the request is serviced using a spare drive, 
there is no need to divide the write data as in the case where 
no faulty drives exist. That is, of the multiple data of the 
parity group to which the write data belongs, the data held 
in the drives other than the faulty drive as well as the parity 
data applicable to that parity group are read out New parity 
data is generated from the data thus read out and from the 
write data supplied anew from the CPU 1. The write data is 
written to the spare drive, and the newly generated parity 
data replaces the old parity data. 

The fourth embodiment may also be applied to RAID 4 
array controllers. 
(Fifth embodiment) 

■Unlike the embodiments described above, the fifth 
embodiment is one which applies where a board connector 
has developed a fault while all drives remain normal. 

In case of a poor contact or other contact-related failure 
between the mother board and the connector of any drive 
board, all disc drives on that board become inaccessible. In 
that case, it is necessary to withdraw the board in question 
from the mother board so that the board will be replaced by 
a normal board. The fifth embodiment is in fact any of the 
preceding embodiments applied in that eventuality, 

When the user detects a faulty connector of any board, the 
user gets the CPU 1 to issue a command notifying the array 
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controller 2 of the impending withdrawal of the board. The 
user then withdraws the board in question from the mother 
board. Thereafter, the system operation is the same as that of 
any of the preceding embodiments which may be in use. 

A poor connector contact of any drive board may be 
detected as follows: when any one drive on a given drive 
board currently mounted on the mother board is accessed 
unsuccessfully, the array controller 2 sets in the address table 
a flag indicating a fault of that drive. If other drives on the 
same board are found consecutively to be inaccessible, there 
is a high possibility of a fault other than drive-related faults 
in the individual drives. This should prompt the user to 
investigate what went wrong. When the user detects a poor 
connector contact of the board, the user inputs the command 
mentioned above. If there have been faulty drives detected 
prior to the check on faulty connector contact, the user enters 
commands for turning off the fault bits corresponding to 
such drives. In response to these commands, the array 
controller 2 resets the fault bits corresponding to the faulty 
drives on the designated board. 

As described, the disc array system according to the 
invention operates continuously even as the board having 
faulty drives is being detached from the mother board in 
order for such drives to be replaced by normal alternate 
drives. 

In case of a connector fault of any drive board, it is 
possible to execute data read and write requests designating 
data in any drive on the board in question. 

As many apparently different embodiments of this inven- 
tion may be made without departing from the spirit and 
scope thereof, it is to be understood that the invention is not 
limited to the specific embodiments thereof except as 
defined in the appended claims. 

What is claimed is: 

1. A disc array system, comprising: 

(a) a group of disc storage devices divided into a plurality 
of logical groups, each logical group comprising a 
plurality of disc storage devices for holding groups of 
error correction data, each error correction data group 
comprising a plurality of data and an error correction 
code for the plurality of data 

(b) a plurality of local boards each for holding a number 
of disc storage devices within the group of disc storage 
devices; 

(c) an array controller connected to the group of disc 
storage devices in such a manner that each local board 
can be disconnected from the array controller and can 
be connected to the array controller again thereafter, 
said array controller executing read requests and write 
requests provided by an upper device; 
(cl) each write request being executed in such a 

manner that (ell) an error correction code is gener- 
ated which should belong to an error correction data 
group to which write data attached to said each write 
request belongs, and that (c!2) the write data and the 
generated error correction code are written into plu- 
ral disc storage devices within the group of disc 
storage devices as data belonging to one error cor- 
rection data group; 
(c2) each read request being executed in such a manner 60 
that (c21) data requested by said each read request is 
read from at least one of the group of disc storage 
devices in case the one disc storage device is not 
faulty and that (c22) the requested data is recovered 
in case the one disc storage device is faulty, based 
upon one error correction code and plural other data 
which belong to one error correction data group to 
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which said requested data belongs, and are held in 
other disc storage devices belonging to one logical 
group to which said one faulty disc storage device 
belongs; 

wherein said group of disc storage devices are held on 
said plurality of local boards in such a manner that 
different sub-groups of disc storage devices belonging 
to a same logical group are held on mutually different 
ones of said plurality of local boards. 

2. A disc array device according to claim 1, further 
comprising: a common board for holding the plurality of 
local boards in such a manner that each local board can be 
withdrawn from the common board and can be inserted 
again into the common board thereafter; 

wherein said plurality of local boards is held by the 
common board in a substantially perpendicular position 
thereto and substantially in parallel to each other local 
board; 

wherein said array controller is connected to the group of 
disc storage device by way of signal lines provided on 
the common board. 

3. A disc array device according to claim 2, wherein plural 
disc storage devices mounted on each local board belong to 
mutually different logical groups. 

4. A disc array device according to claim 2: 

wherein said array controller accepts a read request which 
is supplied by said upper device after withdrawal of one 
of said local boards from said common board, and 
which requests readout of one data held on one faulty 
disc storage device held on said one local board; 

wherein said array controller executes an accepted said 
read request, in such a manner that said one data is 
recovered based upon plural other data held in other 
disc storage devices held on local boards other than 
said one local board, and that said recovered one data 
is supplied to said upper device; 

wherein said array controller accepts another read request 
which is supplied by said upper device after said 
withdrawal and which requests readout of another data 
held in another normal disc storage device held on said 
one local board; 

wherein said array controller executes said accepted 
another request, in such a manner that said another data 
is recovered based upon plural further other data held 
in further other disc storage devices held on local 
boards other than said one local board, and that said 
recovered another data is supplied to said upper device. 

5. A disc array device according to claim 4, further 
comprising a random access memory; 

wherein said array controller further accepts write 
requests each of which is supplied after said withdrawal 
and requires write data to be written into either one of 
said faulty disc storage device and said another normal 
disc storage device held on said one local board; 

wherein said array controller temporarily stores write data 
requested by each of said accepted write requests into 
said random access memory; 

wherein, after said one faulty disc storage device has been 
replaced by a normal replacing disc storage device and 
the one local board is inserted to said common board 
again, said array controller then executes said write 
requests to the replaced said one local board by using 
plural write data held in said random access memory. 

6. A disc array device according to claim 5, wherein said 
array controller recovers plural data which were held in said 
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one faulty disc storage device, and writes said recovered 
plural data into said replacing disc storage device, after 
insertion of said one local board and before execution of said 
write requests accepted after disconnection of said one local 
board from said common board; 3 
wherein recovering of said plural data is executed based 
upon plural other data held in disc storage devices held 
on local boards other than said one local board; 
wherein said other disc storage device are ones other than 
said one faulty disc storage device among disc storage 10 
devices belonging to one logical group to which said 
one faulty disc storage device belongs; 
wherein said plural other data comprise plural error 
correction codes belonging to error correction data 
groups to which said plural data held in said one faulty 15 
disc storage device belong and plural data other than 
said plural data held in said one faulty disc storage 
device and said plural error correction codes among 
plural data belonging to those plural correction data 
groups. 20 

7. A disc array device according to claim 4, 

wherein said array controller further accepts write 
requests each of which is supplied after said withdrawal 
of said one local board and requires write data to be 
written into either one of said faulty disc storage device 25 
and another normal disc storage device held on said one 
local board; 

wherein said array controller executes each of said 
accepted write requests before said one faulty disc 
storage device is replaced by a normal replacing disc 30 
storage device and said one withdrawn local board is 
inserted to said common board again, in such a manner 
that an error correction code is generated for write data 
attached to said each write request, and is written into 
other one disc storage device held on one of said 35 
plurality of local board other than said one local board; 

wherein said other one disc storage device is one belong- 
ing to an error correction data group to which one of 
said faulty disc storage device and said another normal ^ 
disc storage device into which said each write data is to 
be written belongs; 

wherein said array controller recovers plural data to be 
held in each of said replacing disc storage device and 
said another normal disc storage devices based upon 45 
plural other data held in other disc storage devices held 
on other local boards other than said one local board, 
and writes said plural data recovered for each of said 
replacing disc storage device and said another normal 
disc storage device therein. 50 

8. A disc array device according to claim 7, wherein 
recovering of plural data for said another normal disc 
storage device is executed in such a manner that plural data 
supplied by said upper device to be written into said another 
normal disc storage device after said withdrawal of said one 55 
local board are recovered but plural data held by said another 
normal disc storage device before said withdrawal are not 
recovered. 

9. A disc array device according to claim 4, further 
comprising at least one spare disc storage device connected $0 
to said common board; 

wherein said array controller accepts write requests each 
of which is supplied by said upper device after said 
withdrawal of said one local board and requires write 
data to be written into one of said one faulty disc 65 
storage device and another normal disc storage devices 
held on said one local disc storage device: 
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wherein said array controller executes each accepted write 
request in such a manner that write data required by 
each accepted write request is written into said spare 
disc storage device; 

wherein said array controller executes a read request 
supplied by said upper device after withdrawal of said 
one local board, so that data required by the read 
request is read out from said spare disc storage device, 
in case said required data is held in said spare disc 
storage device. 

10. A disc array device according to claim 9, 
wherein said array controller writes data recovered as a 

result of execution of a preceding read request supplied 
by said upper device after said withdrawal of said one 
local board, into said spare disc storage device, in case 
said preceding read request requests data held in one of 
said one faulty disc storage device and said another 
normal disc storage devices held on said one withdrawn 
board; 

wherein said array controller reads said recovered data 
written into said spare disc storage device, in case a 
subsequent read request supplied by said upper device 
requires read out of same data as the data required by 
said preceding read request. 

11. A disc array device according to claim 9, 
wherein said array controller transfers part of plural data 

held in said spare disc storage device into said another 
normal disc storage devices held on said one local 
board, after said one faulty disc storage device is 
replaced by a normal replacing disc storage device and 
said one local board is inserted to said common board 
again, said part of plural data including plural data 
which were required to be written into said another 
normal disc storage device by write requests supplied 
by said upper device after withdrawal of said one local 
board. 

12. A disc array device according to claim 11, 
wherein, after replacing of said one faulty disc storage 

device and said insertion of said one withdrawn local 
board, said array controller recovers plural data to be 
held in said replacing disc storage device, based on 
plural data held in plural disc storage devices held on 
plural local boards other than said one local board, and 
writes said recovered plural data into said replacing 
disc storage device, said plural data recovered includ- 
ing plural data held by said faulty disc storage device 
before said faulty disc storage device has become faulty 
and plural data supplied by said upper device as data to 
be written into said faulty disc storage device after said 
withdrawal of said one local board. 

13. A disc array device according to claim 4, further 
comprising a plurality of spare disc storage devices; 

wherein said array controller selects one of said plurality 
of spare disc storage devices for each of said faulty disc 
storage devices and said normal disc storage devices 
held on said one board, at withdrawal of said one local 
board; 

wherein said array controller accepts a write request 
which is supplied by said upper device after withdrawal 
of said one local board and which requires write data to 
be rewritten into one of said faulty disc storage device 
and said another normal disc storage devices; 

wherein said array controller executes said accepted write 
request in such a manner that said write data is written 
into one of said plurality of said spare disc storage 
devices selected for said one disc storage device; 
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wherein said array controller accepts a read request sub- 
sequently supplied by said upper device, so that data 
requested by said read request is read out from one of 
said plurality of spare disc storage devices, in case said 
requested data is already held therein. 5 

14. A disc array device according to claim 13, further 
comprising at least one spare local board held on said 
common board so that said spare board can be withdrawn 
from said common board and can be inserted into said 
common board again; 

wherein said plurality of spare disc storage devices are 
held on said spare board, and is connected to said array 
controller by way of the signal line provided on said 
common board. 

15. A disc array device according to claim 13, 
wherein said array controller writes data recovered as a 

result of execution of a preceding read request supplied 
by said upper device after said withdrawal of said one 
local board, into one of said plurality of spare disc 
storage devices selected for said one of said one faulty 
disc storage device and said another normal disc stor- 20 
age device, in case said preceding read request requests 
data held therein; 
wherein said array controller reads said recovered data 
written into said one spare disc storage device, in case 
a subsequent read request supplied by said upper device 25 
requires read out of same data as the data required by 
said preceding read request. 

16. A disc array device according to claim 13, 
wherein said array controller transfers part of plural data 

held in one of said plurality of spare disc storage 30 
devices selected for said another normal disc storage 
device held on said one local board into said another 
normal disc storage devices, after said one faulty disc 
storage device is replaced by a normal replacing disc 
storage device and said one local board is inserted to 35 
said common board again, said part of plural data 
including plural data which were required to be written 
into said another normal disc storage device by write 
requests supplied by said upper device after withdrawal 
of said one local board. 40 

17. A disc array device awarding to claim 16, 
wherein, after said one faulty drive has become faulty, 

said array controller recovers plural data which was 
held by said one faulty disc storage device before said 45 
one board has become faulty, and transfers said recov- 
ered plural data to one of said plurality of spare disc 
storage devices selected for said one faulty disc storage 
device; 

wherein said array controller executes read requests and 50 
write requests supplied by said upper device after said 
one faulty disc storage device has become faulty so that 
the read requests and the write requests are executed by 
using said one spare disc storage device selected for 
said one faulty disc storage device, in case said read 55 
requests and said write requests designates data held in 
said one faulty disc storage device. 

18. A disc array device according to claim 17, 
wherein, after said replacing of said one faulty disc 

storage device and said insertion of said one local 60 
board, said array controller keeps using said one spare 
disc storage device selected for said one faulty disc 
storage device, in place of said one faulty disc storage 
device, and uses said replacing disc storage device as 
one of said plurality of spare disc storage devices. 65 

19. A disc array device according to claim 18, further 
comprising at least one spare local board held on said 
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common board so that said spare board can be withdrawn 
from said common board and can be inserted into said 
common board again; 
wherein said plurality of spare disc storage devices are 
held on said spare board, and is connected to said array 
controller by way of the signal line provided on said 
common board. 

20. A disc array device according to claim 17, 
wherein, after replacing of said one faulty disc storage 

device and said inserting of said one local board, said 
array controller keeps using said one spare disc storage 
device selected for said one faulty disc storage device 
in place of said one faulty disc storage device, and uses 
said replacing disc storage device as one of said plu- 
rality of spare disc storage devices. . 

21. A disc array device according to claim 20, further 
comprising at least one spare local board held on said 
common board so that said spare board can be withdrawn 
from said common board and can be inserted into said 
common board again; 

wherein said plurality of spare disc storage devices are 
held on said spare board, and is connected to said array 
controller by way of the signal line provided on said 
common board. 

22. A disc array device according to claim 16, 
wherein, after withdrawal of said one local board, said 

array controller recovers plural data which was held by 
said one faulty disc storage device before said one 
board has become faulty, and writes said recovered 
plural data into one of said plurality of spare disc 
storage devices selected for said one faulty disc storage 
device; 

wherein said array controller executes read requests and 
write requests supplied by said upper device after said 
one faulty disc storage device has become faulty, so 
that the read requests and the write requests are 
executed by using said one spare disc storage device 
selected for said one faulty disc storage device, in case 
said read requests and said write requests designates 
data held in said one faulty disc storage device. 

23. A disc array device according to claim 13, 
wherein, after replacing of said one faulty disc storage 

device and said insertion of said one withdrawn local 
board, said array controller recovers plural data to be 
held in said replacing disc storage device, based on 
plural data held in plural disc storage devices held on 
plural local boards other than said one local board. 

24. A disc array device according to claim 13, 
wherein, after said replacing of said one faulty disc 

storage device and said inserting of said one withdrawn 
local board, said array controller recovers plural data 
which was held by said one faulty disc storage device 
before said one board has been withdrawn from said 
common board, and writes said recovered plural data 
into said replacing disc storage device; 

wherein said array controller further transfers plural data 
supplied by said upper device after said one faulty disc 
storage device has become faulty from one of said 
plurality of spare disc storage devices selected for said 
one faulty disc storage device, to said replacing disc 
storage device; 

wherein said recovering is executed based on plural data 
held in plural disc storage devices held on plural local 
boards other than said one local board. 

25. A disc array device according to claim 1, wherein 
plural disc storage devices mounted on each local board 
belong to mutually different logical groups. 
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26. A disc array device according to claim 1, 
wherein said array controller executes each write request 

supplied by said upper device, in such a manner that 
write data required by said each write request is divided 
into a plurality of sub-data, generates an error correct- 5 
ing code for said plurality of sub-data, and write said 
plurality of sub-data and said generated error correction 
code into plural disc storage devices held on mutually 
different local boards, as data belonging to one error 
correction data group; 10 
wherein said array controller executes each read request 
supplied by said upper device, in such a manner that a 
plurality of sub-data which form data requested by said 
read request are read from plural disc storage devices 
held on mutually different local boards, said plurality of 15 
sub-data combined into one data, and said one data is 
supplied to said upper device. 

27. A disc array device according to claim 1, 
wherein an error correction data group comprises a plu- 
rality of write data required by a plurality of write ^ 
requests supplied by said upper device and an error 
correction code for said plurality of write data; 

wherein said array controller executes each write request 
supplied by said upper device, in such a manner that 
said old write data to be updated by write data attached ^ 
to said each write request is read and an old error 
correction code belonging to an error correction data 
group to which said write data belongs, a new error 
correcting code for said error correction data group is 
generated by said write data, said old error correcting 
code and said new error correcting code, and said write 30 
data and said generated new error correction code are 
written into plural disc storage devices held on mutu- 
ally different local boards, as data belonging to said 
error correction data group; 

wherein said array controller executes each read request 35 
supplied by said upper device, in such a manner that 
data requested by said read request is read from one of 
disc storage devices held on one of said plurality of 
local boards, and is supplied to said upper device. 

28. A disc array device according to claim 1: 40 
wherein said array controller accepts one read request 

which is supplied by said upper device after withdrawal 
of one of said local boards from said common board 
and which requests readout of one data held on one 
faulty disc storage device held on said one local board; 45 

wherein said array controller executes said accepted one 
read request, in such a manner that said one data is 
recovered based upon plural other data held in other 
disc storage devices held on local boards other than 
said one local board, and that said recovered one data 50 
is supplied to said upper device; 

wherein said array controller accepts another read request 
which is supplied by said upper device after said 
withdrawal and which requests readout of another data 55 
held in another normal disc storage device held on said 
one local board; 

wherein said array controller executes said accepted 

. another request, in such a manner that said another data 
is recovered based upon plural further other data held go 
in further other disc storage devices held on local 
boards other than said one local board, and that said 
recovered another data is supplied to said upper device. 

29. A disc array device according to claim 28, further 
comprising a random access memory; gg 

wherein said array controller further accepts write 
requests each of which is supplied after said withdrawn 
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and requires write data to be written into either one of 
said faulty disc storage device and said another normal 
disc storage device held on said one local board; 

wherein said array controller stores write data requested 
by each of said accepted write requests; 

wherein said array controller executes said write requests 
by using plural write data requested thereby and held in 
said memory, after said one faulty disc storage device 
has been replaced by a normal replacing disc storage 
device and the one local board is inserted to said 
common board again. 

30. A disc array device according to claim 28, further 
comprising a plurality of spare disc storage devices; 

wherein said array controller selects one of said plurality 
of spare disc storage devices for each of said faulty disc 
storage devices and said normal disc storage devices 
held on said one board, at withdrawal of said one local 
board; 

wherein said array controller accepts a write request 
which is supplied by said upper device after withdrawal 
of said one local board and which requires write data to 
be rewritten into one of said faulty disc storage device 
and said another normal disc storage devices; 

wherein said array controller executes said accepted write 
request in such a manner that said write data is written 
into one of said plurality of said spare disc storage 
devices selected for said one disc storage device; 

wherein said array controller accepts a read request sub- 
sequently supplied by said upper device, so that data 
requested by said read request is read out from one of 
said plurality of spare disc storage devices, in case said 
requested data is already held therein. 

31. A disc array system according to claim 1, wherein 
each sub-group of said different sub-groups comprises only 
one disc storage device, such that each disc storage device 
belonging to a same logical group is held on a local board 
which is different from other local boards holding other disc 
storage devices in the same logical group, 

32. A disc array system according to claim 1, wherein 
each sub-group of said different sub-groups comprises a 
pi^eterrnined number greater than one of disc storage 
devices, such that each sub-group of said predetermined 
number of disc storage devices is held on a local board 
which is different from other local boards holding other 
sub-groups in the same logical group. 

33. A disc array system according to claim 1, wherein a 
total number of disc storage devices of each sub-group of 
each logical group, held on a same local board does not 
exceed a maximum number of disc storage devices whose 
data can be recovered by using data and error correction data 
held by disc storage devices belonging to sub-groups of disc 
storage discs of said same logical group other than said each 
sub-group of disc storage devices held on said same local 
board. 

34. In a disk array device, having: 

a group of disc storage devices divided into a plurality of 
logical groups, each logical group comprising a plural- 
ity of disc storage devices for holding error correction 
data groups, each error correction data group compris- 
ing a plurality of data and error correction data for the 
plurality of data; 

a plurality of local boards each for holding a number of 
disc storage devices within the group of disc storage 
devices; and 

an array controller connected to the group of disc storage 
devices in such a manner that each local board can be 
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disconnected from the array controller and can be 
connected to the array controller again thereafter, said 
array controller executing read requests and write 
requests provided by an upper device, a disc array 
access method, comprising the steps of; 

(a) determining plural disc storage device which should 
belong each logical group, in such a manner that said 
plural disc storage device are held on mutually 
different ones of said plurality of local boards; 

(b) disconnecting one of said plurality of local boards 
which holds one faulty disc storage device, from said 
array controller at occurrence of a fault to said one 
disc storage device; 

(c) keeping accepting read requests each of which is 
supplied by an upper device after the disconnecting 
and requires readout of data held in one of said one 
faulty disc storage device and another normal storage 
device held on said one local board; and 

(d) executing each read request, in such a manner that 
read data required by said each read request is 
recovered, based upon plural data held on plural disc 
storage devices held on plural local boards other than 
said one disconnected board, and that said recovered 
data is supplied to said upper device. 

35. A disc array access method according to claim 34, 
wherein said disc array device further comprises a random 

access memory; 

wherein said method further comprises the steps of: 

(e) keeping accepting write requests each of which is 
supplied by an upper device after the disconnecting 
of said local board and requires write data to be 
written into one of said faulty disc storage device and 
said another normal disc storage devices held on said 
one local board; 

(f) temporally holding plural write data required by said 
accepted write requests in said random access 
memory; and 

(g) executing, after said one faulty disc storage device 
is replaced by a replacing normal disc storage device 
and the one local board is connected to said array 
controller again, said plural write requests to said 
replacing normal disc storage device by using said 
plural write data held in said random access memory. 

36. A disc array access method according to claim 34, 
wherein said disc array device further comprises at least 45 

one spare disc storage device connected to said array 
controller; 

wherein said method further comprises the steps of: 

(e) keeping receiving write requests each of which is 
supplied by an upper device after the disconnecting 50 
of said local board and requires write , data to be 
written into one of said faulty disc storage device and 
said another normal disc storage devices held on said 
one local board; 

(f) executing each accepted write request in such a 
manner that write data requested by said each write 
request is written into said spare disc storage device; 
and 

(g) after said one faulty disc storage device has been 
replaced by a normal replacing disc storage device 
and the one local board is connected to said array 
controller again, transferring at least part of plural 
write data held in said spare disc storage device to 
said another normal disc storage device held on the 
one local board, said part of said plural write data 
being ones given by said upper device as ones to be 
written into said another normal disc storage devices. 
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37. A disc array access method according to claim 34, 
wherein said disc array device further comprises a plu- 
rality of spare disc storage devices connected to said 
array controller, 

wherein said method further comprises the steps of: 

(e) selecting one of said plurality of spare disc storage 
devices for each of said one faulty disc storage 
device and said another normal disc storage device 
held on said one local board at disconnecting of said 
one local board; 

(f) keeping receiving write requests each of which is 
supplied by an upper device after the disconnecting 
and requires write data to be written into one of said 
faulty disc storage device and said another normal 
disc storage devices held on said one local board; 

(g) recovering at said disconnecting of said one local 
board, plural data which were already held in said 
one faulty disc storage device before said one faulty 
disc storage device has become faulty, based upon 
plural data held on disc storage devices held on 
plural local boards other than said one local board; 

(h) writing said recovered plural data into one of said 
plurality of spare disc storage devices selected for 
said one faulty disc storage device; 

(i) executing said accepted write requests in such a 
manner that write data required by each of said 
accepted write request is written into one of said 
spare disc storage devices selected for one of said 
one faulty disc storage device and said another 
normal disc storage device into which said required 
write data is to be written; 

(j) keeping using said one spare disc storage device 
selected for said one faulty disc storage device in 
place of said one faulty disc storage device after said 
one faulty disc storage device has been replaced by 
a normal replacing disc storage device and the one 
local board is connected to said array controller 
again; and 

(k) using said replacing disc storage device as a new 
spare disc storage device. 

38. A disc array access method according to claim 37, 
further comprising the steps of; 

0) after said connecting again, transferring plural write 
data held in another one of said plurality of spare disc 
storage devices selected for said another normal disc 
storage device to said another normal disc storage 
device held on the one local board; and 

(ra) releasing said another spare device as a new spare 
device. 

39. A disk array system comprising: 

a group of disc storage devices divided into a plurality of 
logical groups, each logical group comprising a plural-, 
ity of disc storage devices for storing data including 
error correction data; 

a plurality of local boards each for holding a predeter- 
mined number of disc storage devices of said group of 
disc storage devices; 

an array controller connected to the group of disc storage 
devices in such a manner that each local board can be 
disconnected from the array controller and can be 
connected to the array controller again thereafter, said 
array controller executing read request and write 
requests provided by an upper device; 

wherein said group of disc storage devices are held on 
said plurality of local boards in such a manner that 
different sub-groups of disc storage devices belonging 
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to a same logical group are held on mutually different 
ones of said plurality of local boards, such that, when 
one disc storage device of an affected logical group 
becomes faulty and an affected local board is discon- 
nected from said array controller, said disk array sys- 5 
tern can continue to operate by reconstructing data 
contained on the faulty said one disc storage device 
using data from disc storage devices of said affected 
logical group, which are held on unaffected, different 
ones of said plurality of local boards, 10 
40. A disc array system, comprising: 

(a) a group of disc storage devices divided into a plurality 
of logical groups, each logical group comprising a 
plurality of disc storage devices for holding groups of 
error correction data, each error correction data group 15 
comprising a plurality of data and an error correction 
code for the plurality of data; 

(b) a plurality of local boards each for holding a number 
of disc storage devices within the group of disc storage 
devices; 20 

(c) an array controller connected to the group of disc 
storage devices in such a manner that each local board 
can be disconnected from the array controller and can 

be connected to the array controller again thereafter, ^ 

said array controller executing read requests and write 

requests provided by an upper device; 

(cl) each write request being executed in such a 
manner that (ell) an error correction code is gener- 
ated which should belong to an error correction data 30 
group to which write data attached to said each write 
request belongs, and that (cl2) the write data and the 
generated error correction code are written into plu- 
ral disc storage devices within the group of disc 
storage devices as data belonging to one error cor- 35 
rection data group; 

(c2) each read request being executed in such a manner 
that (c21) data requested by said each read request is 
read from at least one of the group of disc storage 
devices in case the one disc storage device is not ^ 
faulty and that (c22) the requested data is recovered 
in case the one disc storage device is faulty, based 
upon one error correction code and plural other data 
which belong to one error correction data group to 
which said requested data belongs, and are held in 45 
other disc storage devices belonging to one logical 
group to which said one faulty disc storage device 
belongs; 

wherein said group of disc storage devices are held on 
said plurality of local boards in such a manner that disc 50 
storage devices belonging to a same logical group arc 
held on mutually different ones of said plurality of local 
hoards. 



30 

41. A disc array system, comprising: 

(a) a group of disc storage devices divided into a plurality 
of logical groups, each logical group comprising a 
plurality of disc storage devices for holding groups of 
error correction data, each error correction data group 
comprising a plurality of data and an error correction 
code for the plurality of data; 

(b) a plurality of local boards each for holding a number 
of disc storage devices within the group of disc storage 
devices; 

(c) an array controller connected to the group of disc 
storage devices in such a manner that each local board 
can be disconnected from the array controller and can 
be connected to the array controller again thereafter, 
said array controller executing read requests and write 
requests provided by an upper device; 

(cl) each write request being executed in such a 
manner that (ell) an error correction code is gener- 
ated which should belong to an error correction data 
group to which write data attached to said each write 
request belongs, and that (c21) the write data and the 
generated error correction code are written into plu- 
ral disc storage devices within the group of disc 
storage devices as data belonging to one error cor- 
rection data group; 

(c2) each read request being executed in such a manner 
that (cl 1) data requested by said each read request is 
read from at least one of the group of disc storage 
devices in case the one disc storage device is not 
faulty and that (c22) the requested data is recovered 
in case the one disc storage device is faulty, based 
upon one error correction code and plural other data 
which belong to one error correction data group to 
which said requested data belongs, and are held in 
other disc storage devices belonging to one logical 
group to which said one faulty disc storage device 
belongs; 

wherein said group of disc storage devices are held on 
said plurality of local boards in such a manner that disc 
storage devices belonging to a same logical group are 
held on plural ones of said plurality of local boards. 

42. A disc array system according to claim 41, wherein a 
total number of disc storage devices which are held on a 
same local board and belong to a same logical group does 
not exceed a maximum number of disc storage devices 
whose data can be recovered by using data and an error 
correction code held by disc storage devices belonging to 
said same logical group other than said disc storage devices 
which are held on said same local board and belong to said 
same logical group. 

* * * * * 
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